Resilience of Complex Networks to Random Breakdown 



in 
o 
o 

(N 



Gerald Paul,^'Q Sameet Sreenivasan/ and H. Eugene Stanley^ 
^ Center for Polymer Studies and Dept. of Physics, Boston University 

Using Monte Carlo simulations we calculate /c, the fraction of nodes which are randomly removed 
before global connectivity is lost, for networks with scale-free and bimodal degree distributions. Our 
results differ with the results predicted by an equation for fc proposed by Cohen, et al. We discuss 
the reasons for this disagreement and clarify the domain for which the proposed equation is valid. 
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I. INTRODUCTION 

It has been shown 0, S 0, B that random uncorre- 
lated networks with degree distribution P{k) lose global 
connectivity when 



- (fc) 



< 2. 



(1) 



As explained in Ref. 0, 13 1 random removal of a fraction 
/ of nodes from a network with degree distribution Po{k) 
results in a new degree distribution 



P(fc)= f]Po(fco)Q°) {l-f)'f°-'. 

k„=k ^ ^ 



(2) 



Using this degree distribution to calculate (fc) and (fc 



after 
that 



random removal of sites it was determined 



/c = l- 



KO - 1 



(3) 



where kq is the value of k computed from the original de- 
gree distribution, before the random removal. Equation 
was observed to hold for a number of network types, 
including random networks that have a Poisson degree 
distribution, and was used in the analysis of scale-free 
networks that have power-law degree distributions 0, Q . 

Using Monte-Carlo simulations we find that Eq. © 
does not hold for networks with (i) self-loops and multiple 
edges and/or (ii) high variance in fc. We illustrate our 
findings using scale- free and bimodal networks and clarify 
the domains where Eq. Q is valid. 



II. MONTE CARLO SIMULATIONS 

We create random networks having specified degree 
distributions using the method described in Ref. l| • We 
then randomly delete nodes in the network and after each 
node is removed, we calculate k. When k becomes less 



than 2 we record the number of nodes i removed up to 
that point. This process is performed for many realiza- 
tions of random graphs with a specified degree distribu- 
tion and, for each graph, for many different realizations 
of the sequence of random node removals. The threshold 
fc is defined as 



fc 



N 



(4) 



where (i) is the average value of i. 



III. SCALE-FREE NETWORKS 

We study scale-free random networks with degree dis- 
tribution 



P(fc) 



[m < fc < K]. 



(5) 



We choose the lower cutoff m — A and the upper cutoff 
K — N. In Figs, ^a), (b) and (c), we show the de- 
pendence on A of 1 — obtained by the Monte Carlo 
simulations and compare it with 1 — obtained theo- 
retically from Eq. The simulation results agree well 
with Eq. for A > A*, where A* « 3, and the agreement 
becomes better for increasing TV. However, for A < A* 
there is significant disagreement, and the disagreement 
becomes larger as N increases, as seen clearly Fig. ^ (d) 
in which we plot the normalized difference 



MC 



MC 



(6) 
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The nonzero value of A has its root in the use of Eq. (j^J 
to derive Eq. (|3J). Equation ^ is valid only if, in the 
original network, two conditions hold: (i) There are no 
self loops, i.e. all links from node i are to distinct nodes 
j with j ^ i and (ii) there are no multiple links between 
i and j. In graph theory networks satisfying these two 
conditions are called simple. If the original network is 
not simple, Eq. (01 must then be interpreted as operating 
on the original network but with self-loops and multiple 
links deleted. But this deletion changes the properties of 
the degree distribution. As seen in Figs. |21(a), (b), and 
(c) the cutoff is changed, and for large N, the slope of 
the tail of the distribution is modified. Also the degrees 
of adjacent nodes become correlated as seen in Fig. O 



2 



which shows the A-dependence of the degree correlation 



i 



(7) 



Here Cjk is the joint probabihty of the remaining degrees 
[TT| of the two vertices at either end of a randomly chosen 
edge, qk is the probability of the remaining degree of a 
single vertex at the end of a randomly chosen edge, and 



(8) 



Because of the degree correlations, Eq. no longer ap- 
plies and therefore Eq. (PJ no longer holds. The similarity 
in appearance between Fig.QJd) and Fig.Oconfirms that 
the nonzero correlations play a major role in the differ- 
ence between and f^^. 

We can explain the domain of validity of Eq. ^ as 
follows. It is known @,0ilEI3 that for any desired ran- 
dom degree distribution, the networks created by such 
methods as those of Molloy-Reed 1] or Chun-Lu 6] cre- 
ate simple graphs only if P{k) ~ for k greater than the 
structural cutoff 



probability that almost all the leaf nodes are removed be- 
fore the hub node is removed approaches for large N . 
Let i be the number of nodes which are removed before 
the hub node is removed. Since the removal is random, i 
is uniformly distributed between and iV — 1 and, from 
Eq. igj, fc = 1/2. On the other hand, Eq. ||2Jl predicts 
/c = 1 — 2 /N which asymptotically approaches unity for 
large TV. 

As for the case of scale-free networks, we can under- 
stand this disagreement as a result of the presence of self 
loops. We can also use this star network example to iden- 
tify another implicit assumption used in the derivation of 
Eq. ((2Jl, namely that 



(^)^((^|^.(^) = 2)) = (^|M^))=2) 



(12) 



where k{i) is the value of k after the removal of i nodes. 
That is, we define (i) to be the average of i such that in 
each random removal = 2; the derivation of Eq. Q 
assumes that {%) is equal to i such that the average of 
over all random removals equals 2. Equation H12|) will 
be true in the limit in which the variance ((i — is 
zero. But when the variance becomes large as is the case 
for the star network, Eq. H12|l may be not hold. FigureQ] 
illustrates graphically an example for which Eq. 1)12(1 does 
not hold because the variance in i is large. 



(9) 



It is also known that for scale-free networks the number 
of nodes with degree greater than the natural cutoff 



(10) 



is statistically insignificant 0, ^3 ■ These two facts are 
sufficient to understand that Eq. © is valid for scale-free 
networks only if A > 3 (in which case the natural cutoff 
Kc results in nodes with degree > -y/iV being statistically 
insignificant) or for A < 3 if the maximum degree is less 
than the structural cutoff K^. 



IV. BIMODAL NETWORKS 



B. Other Bimodal Networks 

In order to study other bimodal networks, we extend 
the star network to networks with q high degree hubs 
connected to the remaining nodes of degree one. For 
networks with average degree (fc), the degree distribution 
is specified as 



P{k) = 



{N~q)/N [fc = l] 



q/N 



[fc = fca] 



where 



{{k)-l)N + q 



(13) 



(14) 



A. Star Networks 

First, we discuss a simple example with a bimodal de- 
gree distribution for which Eq. fails. Consider a star 
network of N nodes with degree distribution 



P(fc) 



{N-l)/N [fc = l] 



1/iV 



[fc = iV - 1] 



(11) 



and P{k) = for all other values of fc. If nodes are 
randomly removed, the criterion for losing global con- 
nectivity, K < 2, is obtained when the single node with 
degree A'^— 1, the hub node, is removed or when almost all 
of the degree 1 nodes, the leaf nodes, are removed. The 



and P(k) = for all other fc. We first consider net- 
works with (fc) = 2. In Fig. EJa), for the distribution 
of Eqs. H13|) and H14|) . we plot 1 — /c as a function of q 
for N = 10^, 10^ 10^, and 10^ Also shown in Fig.Efa) 
are plots for approximations /^'^^ and /^°™ which we ex- 
pect to be valid respectively for high and low values of 
q. We will use these approximations to determine how 
fc{q) scales and for which values of q Eq. (PJ is valid. 
The approximations are determined as follows: 

(i) When q ^ N, (i.e., the network is homogeneous) 
we expect Eq. ® to hold so /^^'s*^ = 1 - 1/(ko - 1). 

(ii) For small q, the network loses global connectivity 
when all q high degree nodes are removed. The 



3 



probability that all q high degree nodes are removed 
after the first i nodes of all types have been removed 



N 1^- 

\ q- 



(15) 



Here i is now the average number of nodes that must 
be removed before all q high degree nodes are removed. 



Then (i) = YJ^=qig{q,N,i) and 



How ^ ill ^ 



N 



(16) 



Note that /'°'^ does not depend on (fc) since changing 
{k) results simply in a different number of links between 
the high degree nodes; if our criterion for collapse is the 
removal of all high degree nodes, the number of links 
between them is irrelevant. As expected, the plots of 
/^°^ and Z^'^*^ approximate the values of /c for low and 
high values of q, respectively. 

In Fig. |S{b), we plot the the number of hubs, q* , for 
which the functions fl°'"{q) and fc^^^il) intersect. We 
find that 



q* - N' 



0.5 



(17) 



Similar plots (see Fig. |HJl for (fc) = 3 and (k) ~ 4 also 
exhibit scaling of q* as N^^^ with only a change in the 
prefactor; the scaling is independent of (k). 

The simulation results suggest that q* scales as v^. 
We can show this to be the case by solving analytically 
for q* for large N as follows: For general (fc), using the 
distribution in Eqs. H13|l . we find for q ^ 1 



(18) 



For /'°^, the sum in Eq. H16() can be performed analyti- 
cally, yielding 

fr - + (19) 



NT{N +l)r{q + 2) 



for g > 0. For large N 



r(g + 2)-r(g+l) 



riq + 2) 

To first order in l/q, Eq. ^ yields 



/r = i-i + o(l; 



Equating Eqs. (HU and wc find 



(20) 



(21) 



(22) 



consistent with the plot in Fig. El^b) and Eq. (|17ll . From 
the fact that q* scales like \/N, we conclude that all 



characteristic values of fc scale like v N with a prefactor 
dependent on (k). In particular the value of q at which 
(found from Monte Carlo simulations) agrees to any 
desired degree with the value of f^^ (from Eq. |(2Jl) will 
scale with N in the same fashion in which q* scales with 
N, Eq. (|17|l . For simplicity, we consider Eq.(3) to be 
valid for q > q* . 

We now confirm that the variance in fc is in fact small 
for values of q for which Eq. (fT^ holds. In Fig. [7{a), for 
N = 10^ and g = 1, 5, 10, and 20, we plot P(l - fc) vs. 
1 — fc- As expected, for g = 1 (star network) the distribu- 
tion is uniform because there is an equal probability that 
the single high degree node will be removed at any value 
of i. For the larger values of q, the distributions P{1 — fc) 
develop a well-defined peak. To quantify the definition of 
these peaks, we plot in Fig. [7Jb), the standard deviation 

Of/c 



N 



(23) 



versus g for A^ = 10^ 10^, 10"*, and 10^. Each of the 
plots has a large deviation at g = 1 and decrease to a 
local minimum, the position of which q increases with 
increasing N . For q greater than the q, the deviation is 
small and decreases with increasing A^. In Fig. \^h) we 
plot g as a function of A^. We see that the values of these 
minima are essentially the same as the values of q*, the 
value of q above which Eq. is valid. This is consistent 
with our understanding that Eq. (jJl is valid when the 
variance is small. 



C. Domain of Validity 

Since q and the degree of the hubs fc2 are related by 
Eq. (|14|l , we can determine for what values of k2 Eq. H12() 
is vahd. Substituting Eq. if^ in Eq. l(T^ we find that 
Eq. l(T^ is valid when 



fc2 < Viik) - l)N. 



(24) 



Thus the criterion for Eq. (|12|l holding is essentially the 
same as the criterion discussed in Sec. Illll for the graph 
being simple. The bimodal networks we study here in 
which a relatively small number of nodes control the 
global connectivity of the network yield large variances 
in fc for networks with a given number of nodes; in fact, 
for q ~ 1 the worst case variance is obtained. This sug- 
gests that the criterion of Eq. (|24|l may hold for all degree 
distributions as a requirement for a low variance in fc. If 
this is the case, we can use the requirement that P{k) — 
for k < Kc as the criterion for both the network being 
simple and fc having a small variance. Note, however, 
that while the criteria are similar, it is not true that the 
presence of self-loops and multiple edges implies that the 
distribution of fc has a large variance; for example, the 
variance of fc in scale-free networks is small even in the 
presence of self-loops and multiple edges, as seen in Fig. 
il 
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V. DISCUSSION AND SUMMARY 

We have clarified the domain of validity of Eq. Q, a 
general equation for determining /c, the fraction of nodes 
which must be randomly removed before global connec- 
tivity is lost. For Eq. ^ to be valid, (i) the highest degree 
of any nodes present in statistically significant numbers 
in a random network must be less than the structural 
cutoff Ks = y/{k)N and (ii) the variance of fc must be 
small. For bimodal networks the variance in fc is small 
when the hubs have degree less than ^ ((fc) — l)iV. That 



the bimodal networks we have studied represent a worst 
case for large variance suggests that in general the crite- 
rion that the network be simple is sufficient for Eq. I|12|l 
to hold. It is not clear if there is a deeper connection 
between these two criteria both of which scale as y/N. 
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FIG. 1: For iV = 10^ 10^, and 10* respectively in (a), (b) 
and (c), 1 — fc versus A. The solid line represents the results 
of Monte-Carlo simulations; the dashed line is the prediction 
of Eq. J^J. (d) The difference A (see Eq. between the 
prediction of Eq. ^ and Monte-Carlo simulations for (from 
top to bottom) AT = 10^ 10^, lO". Note that if we had used 
a larger value of the upper cutoff K, then A would decrease 
monotonically from A = 3 to A = 1 instead of having a mini- 
mum near A = 2. 
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FIG. 2: P{k) versus fc for iV = 10^ 10^ IQ-* in (a),(b) and 
(c) respectively. The solid line represents P{k) after network 
construction using the MoUoy-Reed method; the dashed line 
is the distribution after the removal of self-loops and multiple 
edges. 
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FIG. 3: Correlation r as a function of A for (from top to 
bottom at left) A*' = 10^, 10^, and 10"* for distributions after 
removal of self- loops and multiple edges. Note that the cor- 
relation increases with A'' for A < 3 and decreases with A'^ for 
A>3 




FIG. 4: Example illustrating case in which = 2) 7^ 

— 2) for star network of 1 hub of degree 99 and 99 
nodes of degree 1. Thin lines are n vs i, where i denotes the 
number of the step at which a node is deleted, for cases in 
which the hub is deleted at step (from left to right) 1, 10, 
20, 30, 40, 50, 60, 70, 80, 90 and 100. The thick line is the 
average of the thin lines. Note that the value of i at which 
the average crosses the horizontal line «: = 2 is much higher 
than 50, the average of the values of i at which the thin lines 
cross the horizontal line k — 2. 




FIG. 5: For (fc) = 2 and for (from left to right) N = 
10^ 10^ 10* and 10^ (a) 1 - fc vs. number of hubs q. The 
sohd hues represent Monte-Carlo simulation results. Dashed 
lines(short) are approximation fl"""; dashed lines(long) are 
approximation fc^^^. (b)Number of hubs, q versus A^. 
Squares represent characteristic values q* at which high and 
low q approximations intersect. Triangles represent values of 
q at which the standard deviation in 1 — /c is minimal. 




FIG. 6: (a) Number of hubs, q* , at which approximations for 
low and high q intersect vs. A^. Squares, triangles and circles 
represent networks with (fc) = 2, 3, and 4 respectively. 
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FIG. 7: (a) P(l - fc) the probability distribution of 1 - 
for = 10^ and q — 1 (dashed hne) and (from left to right in 
order of increasing position of peaks) q — 5, 10, and 20. (b) 
Standard deviation a versus q for N = 10^ 10^ 10* and 10^ 
(from left to right in order of increasing length of the tails 
of the distributions). Note that the second peak in this plot 
which is most pronounced for smaller is an artifact of finite 
size 
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FIG. 8: For random scale-free networks with 4 < k < N , 
standard deviation ct/c versus A for AT = 10^ 10^ 10" and 10^ 
(from top to bottom). 



